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Artificial Intelligence (Al) techniques are used for process- 
ing data having multiple possible values, for example, 
gray-scale values of pixels of an image. This is useful in 
scanning of digitally-coded image data to detect anomalies, 
for example defects in some of a series of manufactured 
products. The preferred processing apparatus has either one 
processing channel or multiple parallel processing channels. 
The apparatus samples or scans incoming signals to produce 
a matrix of scan values, selects subsets of the scan values to 
define samples, extracts at least one feature, such as mean 
image brightness (MM), activity (MA), gradient absolute 
value (MG) and/or gradient direction (MD) from each 
sample, uses the at least one feature as an address to retrieve, 
from a table, a corresponding r^obabiHry-of-occurrence 
value, derives, from the r^bahiHry-of-occurrcnce values of 
a plurality of the samples, a decision value, and compares 
the decision value with a redetermined threshold value. The 
probability-of-occurrence values are stored in the table 
during a 'learning" procedure. 


22 Claims, 11 Drawing Sheets 
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METHOD OF CLASSIFYING SIGNALS, THE INVENTION 
ESPECIALLY IMAGE SIGNALS 

An object of the present invention is to provide an 

This application is a continuation of application Ser No "Artificial Intelligence" method of classifying signals which 

07/907304, filed Jul. 1, 1992 now abandoned. " s respectively represent one of multiple possible amplitude 

values of the signal, which method can readily be adapted to 

CROSS-REFERENCE TO RELATED PATENT different tasks or physical applications, and which is distin- 

APPUCAnON AND LITERATURE guished by as simple a "learning" process as possible. 

COOPER et aUNESTOR European application 0 19 1 407 Briefly, the method of the present invention carries out, in 

A2, publ. 20 AUG. 1986; 10 either one channel or in several parallel channels, the 

PROJECT ALIAS, Phase 1 Final Report, Jan. following processing steps: 

1989-March 1990, Research Inst for Applied Knowledge sampling or scanning incoming signals to produce a 

Processing, Ulm, Germany; (in German: Forschungsinstitut matrix of scan values, 

fur anwewiungsorientierte Wissenvcrarbeitung or EA.W.). selecting subsets of said scan values to define samples, 

FIELD OF THE INVENTION 15 extracting at least one feature from each sample, 

The present invention relates generally to processing of usin 8 said at least one feature as an address to retrieve, 

digital or digitized signals and, more particularly, to pro- firom a table » a corresponding probabiHty-of-<xx;urrence 

cessing of gray-scale image data to detect anomalies in the valuc > 

images. 20 deriving, from the rrcbabilityK)f-c<xiirrence values of a 

BACKGROUND plurality of said samples* a decision value, and 

t« mamr fl»iH« ~f j . . t . comparing said decision value with a predetermined 

In many fields of science and technology, it is necessary threshold value, 

to classify real-value signals into one or more classes. Hie . . . . . 

signals may be one-dimensional, such as acoustic signals „ . Atthough me method 18 ada l >ted to learning and recogni- 

and other time-dependent measured quantities of various * on . ""^P^ classes, the learning procedure for a single 

kinds. Two-dimensional signals are functions of two , " P^o^y ^e, according to the inventive 

variables, such as camera images, image sequences, method. The description of the classes is accomplished using 

projections, and acoustic signals in the form of sonograms. , y exam P lcs f which faU within the respective 

Hnally, it is possible to classify n^Smensional signals ^ class - learning process of the present invention extracts, 

which are functions of n variables. from the data, the rules necessary for classification. The data 

THe previous suggestions for classification of signals have ^ * ^ ** '^k' *** ^ 1CamCd 

often been directed to binary signals, that is, signals which ^ uclu y- 

can assume only two states or levels, such as black-white Exemplary applications of the inventive method are, 

images. Such a system is disclosed in the research report 35 amon S others, the recognition of defective products in a 

"PROJECT ALIAS — Phase 1 Final Report, Jan. Paction process (automatic quality control or "reject" 

198*-March 1990," of Research Inst for Applied Know]- removal), the recognition of objects in images of natural 

edge Processing, Ulm, Germany, published May 16, 1990; scenes, and the recognition of anomalies in time series, 

the system uses multiple parallel channels to extract features The signals to be classified by means of the present 

from a stream of binary signals. ^ invention can be generated by a single sensor, e.g. a video 

Thus, in classifying images with multiple gray levels, one camera or CCD (Charge Coupled Device) scanner, or by 

often carries out, prior to the actual classification, a com- multiple sensors which observe the same process or object 

parison with a threshold value, in order to obtain binary 7 ne inventive method is particularly well adapted for solv- 

images adapted for processing by a binary classification pattern recognition problems involving mmtidimen- 

system. However, this often results in loss of important 43 sional signals. It is in a position, given good training 

information. examples, to independently extract their typical features and 

COOPER et al/NESTOR, INC European Patent Appli- to lcarn th<m 11 is P° ssible to tr ^ n a system, equipped to 

cation 0,191,407 A2, published 20 AUG. 1986 and corre- P^ce the inventive method, with cither one class or 

sponding to U.S. Ser. Nos. 702,188 and 775,144, filed 1985, multi P lc classes, each class and each channel generating a 

discloses another prior art system with multiple classifica- 50 

tion channels, whose output signals are combined to reach a The probabiliry-of-occurrence values, which are stored in 

classification result In the individual channels, the signals to tbe table, need not correspond exactly to the previously 

be classified are compared with prototypes, which represent conveyed values for the probability of occurrence; the stored 

their class as a hypersphere about their position in partem values need only to represent a monotonic function of the 

space. The radius of this hypersphere is adjusted using a 55 probability of occurrence. 

learning algorithm: According to a training phase, pattern A refinement of the basic method of the invention is to use 

vectors are associated with test signals of a particular class, multiple processing channels and, prior to further 

to which the prototype also belongs, and in whose hyper- evaluation, to combine with one another the probabilities of 

sphere they fall. The classification is specified using a occurrence retrieved from the tables of the respective chan- 

spacing measurement, namely the spacing from the input 60 Dels. This provides a broader possible field of application for 

pattern vector to the prototype vector. Extensive overlapping the method of the invention. 

of various classes in pattern space leads in a so-called One can carry out the method of me invention by spacing 

"confusion zone" to a rniimplicity of prototypes with very the samples from one another by a sampling interval a, 

small "regions of influence"; these can lead to substantial where a is an integer whose value is between 1 and the 

problems in the capacity to generalize the system. Further, 65 number of temporally sequential sample or scan values 

many runs in the training phase are needed to define the which define a sample. For example, given 9 values, a is 

* rotot yP es - between 1 and 9. The choice of the size of the samples, and 
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of the interval a permits tyatrfwng or adjustment of the the differences between contiguous pixels of the same - 

inventive method to various tasks. sample), Moment, Gradient (Magnitude and Direction), 

Another refinement of the basic method of the invention Texture, Variance, Fractal Dimensions, Derivatives, Rela- 

is to extract in each processing channel, multiple features or tive or Absolute Position of a Sample, and Temporal or 

patterns, which together define an address for retrieval from 5 Spatial Combination with at least 1 other sample, 

the table. In this manner, it becomes simple to factor in the During extraction of features, there occur regularly values 

combined probability of occurrence of the extracted fea- with relatively high precision, that is, a large number of 

tares. In a practical embodiment of the method of the binary places. If one uses such values as addresses for the 

invention, this refinement must also accommodate itself to tables of probability of occurrence, during the learning 

the structure of digital signal processing (DSP) systems, io process, one needs, for statistical reasons, a very large 

which typically operate with an address space of 16 binary number of different samples and different signals, in order 

places. Since a finer quantization of the features is often that a reliable value for the probability of occurrence can be 

neither necessary nor practical, the formation of a common stored under each individual address of the table. In order to 

address for multiple features utilizes the aforementioned meaningfully reduce the number of various samples and 

address space effectively. 13 signals, a further refinement of the invention provides that 

A further increase in the flexibility of the method of the the features used for formation of the respective addresses 

invention is made possible if the address also contains are scaled and/or quantized, ie, approximated to a consistent 

supplemental information (Z) which is not derived by number of binary places. 

extracting features from the samples. This supplemental By "quantization," we mean me subdivision of the respec- 

information can contain, far example, details about the 20 tive feature into coarser quantized levels, which in its 

combination and/or weighting of features contained in the simplest form can be carried out by leaving off or truncating 

same address, about circumstances during generation of the least significant binary places. By "scaring,- we mean pri- 

signals, and/or details relating to the internal state of the manly use of non-linear characteristic curves, so that for 

classification apparatus. Further, the supplemental informa- example, the greater part of a value range is associated with 

ton can be derived from processing steps subsequent to the 25 a single address, while the remainder of the value range is 

feature extraction. quantized over the remaining possible values or addresses. 

Preferably, one can provide that, in the individual The method of the invention operates completely 

channels, processing occurs with samples of differing sizes translation-invariant This means that the detenriination of 

and/or involves extraction of different features or patterns. the probability of occurrence of a pattern does not depend 

With these differences between the individual channels, the 30 upon the position of the pattern in the signal. This invariance 

handling of manifold and complex assignments is possible. can be partially or completely dispensed with, if the absolute 

One may also provide for other differences among the position, or the position relative to a reference point of the 

individual channels. To the respective channels, one can pattern is chosen as a feature. 

direct signals of different sensors which observe the same For this purpose, during the classification of images using 
process or the same object the inventive method, one can employ supplemental 
Another refinement of the basic method of the invention information, contained in the address, in an advantageous 
is, when combining the probability values retrieved from the way, namely by having the supplemental information con- 
respective tables, to first combine the probability values tain data about the position of the respective sample within 
retrieved, based upon the features of temporally or spatially ^ the image. 

corresponding samples, then to rairaii«ti> from the combined An advantageous learning procedure, in the context of the 

probability values a decision value, and finally to compare inventive method, is for signals to be fed during a learning 

the decision value with a predetermined threshold value. procedure, which signals fall into the class or category to be 

This procedure results in a particularly advantageous com- learned, and for the following processing steps to be carried 

bination of the results of the individual channels. 45 out in a single channel or in parallel in multiple channels: 

In order to suppress noise components of the signals, one from multiple scan values, samples are chosen or defined; 

can provide filtering of the signals prior to feature extraction from cacn sample, at least one feature is extracted; 

and/or filtering of the probability values retrieved from the ^ feamrc? OT me fcatures of respective samples, are used 

table or tables. ^ addresses, so that when a feature occurs, a value 

In a particularly advantageous enu>odirnent of the 50 previously stored, at the address associated with the 

invention, if each sample is an array of pixels which is m feature, is incremented. 

pixels horizontally and n pixels vertically, and the scan or Thereby, one can provide that a non-linear weighting of the 

sampling values represent characteristics (such as mono- frequency or "oftenness" distribution is carried out and/or 

chrome gray-scale values or colors), and the samples are mat ^ <mr ing incrementation, further samples are factored in 

taken successively, line-by-line, the offset from one sample 55 OT taken mt0 consideration. 

to the following one along the horizontal fine is a distance A refinement of the invention is to display, on a 

of between 1 and rn pixels wide, and the offset in the CRT or LCD or plasma screen or the like, the probability of 

perpendicular direction (e.g. vertically) is a distance occurrence data retrieved from the table and/or combined, 

between 1 and n pixels high. Such a depiction can be used for verification of the com- 

WUh this embodiment, the classification of various image 60 pleted classification, which regions have contributed to the 

formats is possible. For example, in the field of quality decision and how. This is another advantage over other 

control, the surfaces of a series of industrial products can be learning procedures, which mostly permit no localized veri- 

monitored for anomalies, eg. scratches and striations. fi cation of the decision. 

Preferably, in this embodiment m=n, which means a qua- DRAWING 

dralic or square sample. 65 

Features which the method is well adapted to extract FIG. 1 is a schematic representation of the method of the 
include: Mean Value, Activity (Sum of the absolute values of present invention; 
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FIG. 2a is a block diagram representing a channel dining method step C to define a superfaypothesis SH. from this, a 

classification of signals while the equations of FIG. 2b show further step E calculates a decision value, which subsc- 

the calculations which computing units MM, MA, MG and quently is compared in step S with a threshold value. The 

MD of FIG. la perform; result R is available at the output as a determination of the 

FIG. 3 is a block diagram representing a channel during 5 class membership or categorization, 

a 'learning** step; FIG. 2 is a more detailed block diagram of one of the 

FIG.4isablc<fccyagramshowmgawayo channels Kl through K32 of FIG. 1. An input 1 is connected 

results of various channels; to a filter F (shown here as a low-pass filter), whose output 

FIGS. 5Ato 5R are a series of screen displays illustrating lfl {■ connected to a circuit P for breaking down the input 

the results of various method steps for a first Image to be 10 mt0 sam P les - Grant P consists essentially of a 

classified: read-write memory 2 with a data input D, and a data output 

, ^ „ , . . D„, as well as an address generator 3 which can be pro- 

HGS. 6A and 6B are a series of screen displays during g ^ amscd by fcd mt0 its j^t 4. Input 4, as well as 

classification of a further image; and further inputs 5-9, shown at the top of FIG. 2a, serve for 

FIGS. 7A to 7G are a series of screen displays during use 15 feeding of control signals generated by a control unit* not 

of the method of the invention for industrial quality control. shown, which direct the individual circuits to perform vary- 

Idrntical elements shown in different figures are desig- ing functions, depending upon the respective task and the 

nated with the same reference numeral in each figure. respective operating state. 

DETAILED DESCRIPTION Read-write memory 2 has a capacity of 256x256 values. 

x/cisuuLu uouuriiui 20 Fof simplicity of illustration, FIG. la shows only memory 

For carrying out the method of the present invention, locations for an 8x8 array of sample values. For recording 

suitable computers are those with a corresponding process- of the filtered original image, the address generator 3 is so 

ing capacity. Due to the relatively large number of partially controlled mat the address for read- write memory 2 is 

parallel operations, multiple computers, processors, incremented line-by-line, that is, in the sequence al, a2, . . 

transputers, and the like can be used. The FIG. 1 represen- 23 . a8, bl, b2, . . . b8. 

tation of the inventive method can thus be seen as a For formation of samples, address generator 3 is pro- 
suggestion for a suitable program, or as an example of a grammed so that the respective scan values of a sample are 
structure of individual circuit groups. In the sense of a sequentially read out over data output D 0 and fed to the 
concrete or liardware" representation, FIGS. 2 through 4 subsequent or downstream circuit M. Given a sample of 9 
are directed to circuits per se. However, a person skilled in 30 image elements or pixels, as shown in FIG. 2a, the readout 
the data processing art could readily use these circuit dia- begins with pixel al, followed by a2 and a3, then pixels bl, 
grams to develop suitable "software" programs to perform b2, and b3, then finally pixels cl, c2, and c3. If the sample 
the functions. with the foregoing image elements is processed in the 

The method depicted in FIG. 1 starts from an original downstream circuit M, next a one-pixel rightwardly trans- 
image • which, for example, is already in digital form and feted sample with the image elements indicated by the 
consists of a 256x256 array of image elements or pixels, dashed border is read out Further samples shown at top right 
each having a gray scale value represented by an 8-bit follow, until the dash-dotted sample is read out Thereafter 
dataword. For signal processing, 32 channels Kl through follow a further series of samples, starting from image 
K32 are provided. Depending upon the task to be performed, element or pixel bl. The top-left image element of a sample 
one could also use a smaller or larger number of channels. will be called the "origin" of the sample in the following 

At the input of each channel, a respective filter Fl through discussion. 
F32 is provided, with which components, of original image As previously mentioned, in the method of the present 
t, unsuited for classification, are suppressed. Individually, invention, one can process, in the different channels, 
for example, one could suppress {redetermined frequencies, ^ samples of differing sizes. One could also provide varying 
especially with a low-pass filter, or one could carry out an numbers of steps from sample to sample, 
undersampling. Frequently, one can also apply a 44 window w ^ mc hardware embodiment of FIG. 2a, circuit M con- 
function, according to which only the image elements within tains four computing units, which are programmable via 
a predetermined window, or range of values, get passed respective inputs 5 through fc Computing unit MM has, in 
along fee further processing. x the example illustrated, the function of finding the mean or 

After the filtering, formation of samples PI through P32 average value of the scan values of a sample, computing unit 

is carried out Details are set forth below in connection with MA serves for calculation of the activity within the sample, 

FIG. 2. Prom each sample, a respective element Ml through computing unit MG determines the absolute value of the 

M32 extracts features, for example four features per sample. gradient, and computing unit MD determines the direction 

/ In order to avoid an excessive number of addresses, a 55 of the gradient within the sample. The equations, according 

respective stage Ql through Q32 quantizes the features, e.g. to which the respective computing units operate, are set forth 

by truncating several least significant binary places. The in FIG. lb. 

' — -individually quantized features are assembled, in a re spec- The results of the feature extraction arc quantized in 1 

bve vector stage VI through V32, into a feature vector. Each circuit Q. Depending upon the kind and significance of the I 

feature vector is fed to a respective table Tl through T32, in 60 individual features, a suitably fine quantization can be~^ 

which there is also recorded, for each vector, an associated carried out The results are assembled at V into a feature 

probability of occurrence, derived from a respective learning vector MV. In the example, the features Mean Value, 

stage LI through L32. Activity, Absolute Value of Gradient, and Angle of Gradient 

The probabilities of occurrence retrieved for each image are each quantized to 3 binary places, so that, of a total of 

are available as a matrix, corresponding to the origins or 65 16 binary places, 4 binary places remain for supplemental 

sources of the individual samples, and form a respective information Z. In the example shown in FIG. 2, these 

hypothesis HI through H32. These are combined in a further remaining bits are used to add location information to the 
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feature vector, which is accomplished using the address 
generator. Further, as shown at the right side of FIG. 2a, the 
control unit (not shown) can use input 9 to set one binary 
place of supplemental information Z arbitrarily, as desired 
The feature vector MV and supplemental information Z 
are fed to a read- write memory 10 as an address. Memory 10 
stores, for each address MV+Z, a value which was derived, 
in a learning process, from the probability of occurrence of 
the respective feature vector MV. This value can directly 
represent the probability of occurrence, but alternatively, can 
also represent a monotonic function of the probability of 
occurrence. 

This value will henceforth be referred to as "Normality" 
since it represents a standard of the extent to which the 
respective feature vector MV is "normaT for an image fed 
during the learning process. 

For each sample, in dependence upon the associated 
address MV+Z, a Normality value is retrieved from read- 
write memory 10. This value is stared in a further read-write 
memory H (FIG. 2a), at an address corresponding to the 
origin of the sample. This address is fed from address 
generator 3 to the address input A of memory H. By 
line-by-line retrieval addressing of memory H, one can read 
out the Normalities line-by-line at data output 

Before proceeding to a discussion of the processing of a 
hypothesis (or hypotheses in the case of multiple channels), 
an explanation of the "learning" process is in order. 

FIG. 3 illustrates selected portions of a channel, which are 
involved in the learning process or are controlled other than 
in accordance with the classification. The generation of 
feature vector MV and of supplemental information Z 
occurs in the same manner as during the classification itself. 
Illustration of circuits F, P, M» Q, and V has therefore been 
omitted A memory for hypothesis H is not required in 
connection with the learning process. 

In addition to a read-write memory T for the Normalities, 
the arrangement according to FIG. 3 includes a computing 
element 21 and a control apparatus 22, which actuates 
switchover between a Write mode and a Read mode, as 
indicated at input W/R of memory T. Pries' to the beginning 
of the learning procedure, the content of the read-write 
memory is erased As soon as an address MV+Z is present 
for a sample, the content of the corresponding cell of 
read-write memory T is read out, and modified in accordance 
with an instruction previously applied to a control terminal 
23 of computing dement 21. In the simplest case, this 
modification consists of incrementation, so mat the normal- 
ity is directly proportional to the frequency of occurrence of 
the corresponding address. 

The computing element can, however, apply other 
functions, particularly non-linear ones. The result of the 
calculation is written back into read- write memory T at the 
same address. Then the address derived from the next 
sample is fed, and the previously stored Normality is cor- 
respondingly modified 

FIG. 4 illustrates an example of the combination of the 
hypotheses, derived in the individual channels, into a 
superhypothesis, of the calculation of a decision value, and 
of the evaluation of the decision value. 

The hypotheses of the individual channels are read out of 
the respective read-write memories HI through H32, in 
which they were previously stored in accordance with FIG. 
2. For generation of the addresses, there is an address 
generator 25, which feeds the same addresses to the address 
inputs A of each respective read-write memory HI through 
H32. Thus are applied simultaneous, location ally corre- 
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spending Normality values to inputs of a computing unit 26, 
which is programmable, via a control input 27, to carry out 
various calculations. One can, for example, derive the mini- 
mum value or the maximum value or the mean value of the 

5 Normality values fed in, The result is written, on the one 
hand, to a further read-write memory 28 while using the 
same addresses, and is available there, for example for 
readout as a screen image representation. 

In order to form a decision value, according to the 

io ernbodiment of FIG. 4, the output values of computing unit 
26 are added up in a further computing unit 29. Thus is 
formed a decision value, which is fed to a threshold value 
circuit 30. Each of computing unit 29 and threshold circuit 
30 is controllable, over a respective control input 31, 32. 

13 The more features, of an image to be classified fall into 
the 'feature region** created during the learning procedure, 
the more Normalities of the hypotheses and of superhypoth- 
eses assume greater values in the region of their upper limit 
values. Correspondingly, their sum, i.e. the decision value, 

20 becomes larger. In accordance with a threshold determined 
for each respective application, one can decide whether the 
image to be classified falls into the learned class or not 
Thus, at output R, a corresponding digital signal is made 
available. 

25 FIGS. 5A to 5R show using multiple screen images, the 
signals arising during the various processing steps, subject 
to the following conditions: in a previous learning 
procedure, fractals were learned, since such images are well 
suited for experimentation and demonstration purposes. 
Four channels were used, in each of which the features Mean 
Value, Activity, and Gradient Absolute Value were extracted 
The step width in all channels during formation of samples 
was 1, while the sizes of the samples vary from channel to 
channel. The individual images are designated according to 
the method step (FIGS. 1-2) through which they were 
created 

The original image according to FIG. 5A represents a 
fractal which falls into the learned class, but which contains 

^ as an anomaly, the word "ANOMALY." FIG. 5B represents 
the superhypothesis from which, in a simple manner, as 
described in connection with FIG. 4, the result R can be 
computed It is apparent that the surface components, and 
thus also the number of Normalities with a low value (in the 

45 representation as an image, the number of dark pixels or 
image elements) clearly exceeds the fraction of further 
Normalities of low value distributed across the image. 

These further Normalities of low value scattered across 
the image arise essentially through an incomplete learning 

50 process. During practical application of the method for 
economic reasons, the learning procedure is not driven so 
far, that all conceivable images to be classified with all 
Normalities reach the maximal value. The threshold at 30 
(FIG. 4) is set correspondingly. 

55 Should it be useful, one could for further evaluation of 
the superhypotheses, draw upon other known image pro- 
cessing methods which, for example, suppress solitary- 
occuring Normalities of low value (individual black points 
in FIG. SB) and consider only greater aggregations. 

60 As the images in FIG. 5A and FIG. 5B show, the anoma- 
lous ness within the superhypothesis impacts the observer 
substantially more strongly than in the original image. 
Therefore, in this connection, it is pointed out that, depend- 
ing upon the use or application, the method of the invention 

65 is advantageously usable with the formation of one or 
multiple hypotheses or superhypotheses. On the one hand 
for example, in scientific research, the anomalies of an 
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image can be easily observed without carrying out a fully bits In the whole value range from 09b to 360%, while the 

automatic classification. On the other hand, the observation Magnitude is, as in Channel 1, linearly quantized using 8 bits 

of "intermediate images" during automation applications, only in the range from 0% to 6%. The resulting feature 

eg. during the learning procedure or in a test phase required vector is again used far addressing of the Normality table of 

by the learning procedure, is provided for, even though one 5 toe channel. 

would prefer a fully automatic classification during regular The range hypotheses produced in this manner (FIG. 7C 

operation. and 7D) result through the medium of the superhypotheses 

The images of FIDS. 5C through 5F show the results of ^ from whose n*** Normality value the decision 

Mean Value formation in channels Kl through K4 with valu f is , emulated. If the decision value lies above a 

samples of different sizes. The larger the samples are, the 10 P™"?* dctecrmned or specified threshold, the input 

more details are traiisinitted, SoSng simto applies for J"* 18 dassificd 85 g(X>d ; lt 15 dasafied as 

details of fractals according to Activity, which is illustrated „ . . . ... ^ . , , . 

. - ;„ n „ m „ ct^o cv 4i v ew A — i • _ Various changes and modifications, and additional physi- 

in the images erf FIGS. 5K through 5N.Aj sample sues ^ Hcatiolls f „ m£ withln ^ ^ toe ^ 

increase, the individual values of the activities equalize For tive concept 

™^Z^^ A suitable memory 2 is model no. IDT7006 made by the 

result high Activity values. firm integrated Device Technology. For 64K-Byte inemory 

Images of FIGS. 5G through 51 are formed from the 4 m needed. 

Absolute Values of the Gradients. In the small samples in a suitable address generator 3 is model no. IMS T800 

channel 1, (image 5G\ the written stroke comes across made by the firm INMOS. The IMS T800 is a 32-Bit 

clearly, whereas in larger samples, little differentiation 20 microprocessor, which is part of the transputer family made 

between the written stroke and the fractal is seen. by INMOS (now part of SGS-Thomson). 

The individual features represented in the images of The address generation for memory 2 can /or example, be 

FIGS. 5C through 5F, 5K through 5N, and 5G through 5J, done with an OCCAM- program such as the following 

form in each chancel an address with which one can read out ( — OCCAM is a specific programming language for 

from the table a Normality value, which leads to the hypoth- 25 transputers—): 

eses shown in images of FIGS. SO through SR. As a Di=data input for memory 2 

consequence of the previous learning procedure, the images Da=data -output for memory 2 

of FIGS. SO through 5R exhibit, at the places, whose content A?=address input for memory 2 

is determined exclusively by the fractal, essentially white sampleLengthY=length of sample mY-direction (from input 

image elements, Le. maximal Normality. Since the anomaly 30 \ _ . ^ _ , . __ „ . 

is not learned, it leads to black pixels, Le. to very low ^^ngthX=4ength of sample in X-direction (from 

Normalities. readimage data 256x256 

The example illustrated in FIG. 6A and 6B starts from an SEQ i=0 FOR 256 

original image representing a fractal which has noise super- 3J §gQ i_q p^ R 255 

imposed on it at two places. From the superhypothesis SEQ 

depicted in FIG. 6B, it is apparent that the associated A^<256*iVH 

Normality values are high for the undisturbed image regions YalueIn=I>i 

and low for the disturbed or noisy image regions. By means data(A)*=valueIn 

of the calculation, represented in connection with FIG. 4, of ^ write sample data 256x256 

a decision value and the subsequent comparison with a « pri ^ Pno n * n eavnw<lgtl ^^v^n, ,.«f~ 

threshold value, the presence T a cUstmW or anomaly KTf^S 

can easily be detected here also. ^2 (^-sampleLengthX^olumn- 

FIGS. 7A to 7G illustrate a successful application of the SEQ m=0 FOR sampleLengthY 

inventive method to industrial quality control Images of 43 SEQ n=0 FOR sampleLengthX 

metallic surfaces are examined and, after a training run of SEQ 

100 good images, the surface images are classified as either A^=(((256*k)+I>f(256*ni)}fn 

"good" or "bad." For this application, two channels were valueOut?=data(A) 

used. Each original image (FIG. 7A) is fed separately to each Dor^valueOut 

of the two channels, and there broken down, without 50 Generating the 3-Bit location information for the feature 

filtering, into overlapping samples, sized 6x6 pixels. vector V can /or example, be done with an OCCAM- 

Channel 1 extracts, from these samples, the two features program such as the following: 

"Middle or Mean Value" (FIG. 7G) and "Gradient Magni- Zloc is the additional location information in Y-direction for 

hide" (FIG. 7F). The value of each of these features falls in the feature vector V 

a range between 0% and a maximum value (100%). During 55 k is the row-counter in the write sample data 256x25c?- 

feature quantization, the Gradient Magnitude is linearly routine above and is 

quantized in a range between 0% and 6% of the maximum the location information in Y-direction Zloc:=k/32 

value using 8 bits (mat is, 2*=*256 steps or levels). The Mean The address generation for memory H can jtx example, be 

Value is linearly quantized, using 8 bits, in a range between done with an OCCAM- program such as the following: 

0% and 45%. Feature values, which are larger than the upper 60 AdrH=address input for memory H 

boundary or limit, are assigned the top quantizing leveL By fc is the row-counter in the write sample data 256x256- 

concatenating (stringing together) the quantized feature routine above 

values, a "feature vector** of 16-bit-length is generated, I is the column-counter in the write sample data 256x256'- 

which addresses the Normality table of the channel. routine above AdrH:=<256*kyn 

Channel 2 extracts from the samples, the two features 65 A suitable memory 10 is model no. IDT7006 made by the 

"Gradient Magnitude" (FIG. IF) and "Direction of Gradi- firm Integrated Device Technology. For 64K-Byte memory 

enf (FIG. 7E). The direction is linearly quantized using 8 4 chips are needed. 
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. A suitable memory H is model no. IDT7006 made by the 
firm Integrated Device technology. For 64K-Byte memory 
4 chips are needed. 

We claim: 

1. Method for classification of signals, which signals each 
have more than two possible amplitude values, each 
sequence of signals collectively defining image background 
information and a feature, using data processing means 
having at least one processing channel (K1-K32), compris- 
ing the steps of: 

carrying out a learning process which generates and 
stores, at respective memory addresses in a memory of 
said data processing means, a table consisting of a 
plurality of probability -^-occurrence values of respec- 
tive features, each occurrence of a particular feature 
during said learning process causing incrementing of 
the stored probability value of the feature; 

sampling incoming signals to produce a matrix of scan 
values; 

selecting subsets of scan values to define samples; 

extracting at least one feature from each sample; 

using said at least one feature as an address to retrieve, 
from said table, a corresponding one value from among 
said plurality of probability-of -occurren ce values; 

deriving from said one probability-of-occurrence value 
for each one of a plurality of said features respectively 
corresponding to a plurality of said samples, a decision 
value; 

comparing said decision value with a predetermined 
threshold value for classifying the signals; and 

subjecting said signals to a window function prior to said 
step of extracting features from said samples, said 
window function including the step of passing only 
ones of said samples within a predetermined range of 
values for further processing by said step of extracting. 

2. Method according to claim 1, wherein 
a plurality of processing channels are used; and 
said decision-value-deriving step includes electronically ^ 

combining the probability-of -occurrence values of all 
of the samples defined in said selecting step. 

3. Method according to claim 2, 
further comprising 

obtaining said incoming signals by imaging each of a 

series of objects; and 
physically separating said objects into at least two groups, 

based upon whether said decision value for each object 

exceeds said threshold value. 

4. Method according to claim 2, wherein 
respective channels extract respectively different features 

from said samples. 

5. Method according to claim 2, wherein 
respective channels process samples of different respec- 
tive sizes. 

6. Method according to claim 1, wherein 
a plurality of channels are used; and 
a plurality of features are used to generate each address 

for retrieval from said table; 
said retrieved probability>of-occurrence values, from 
respective ones of said plurality of channels, are com- 
bined to derive said decision value, which is then 
compared with said predetermined threshold. 

7. Method according to claim 1, wherein 
said samples are temporally spaced from each other by 

about a scan period a, which has an integer value 


between 1 and the number of values which collectively 
constitute each sample. 

8. Method according to claim 1, wherein 
said extracting step comprises extracting a plurality of 

features in each channel, and 
said features are used together to define said address in 
said table. 

9. Method according to claim 4, wherein 
said address includes supplemental data (Z) derived inde- 
pendently of features extracted from said samples. 

10. Method according to claim 9, wherein 
said supplemental data (Z) includes information concern- 
ing relative weighting of features contained in the same 
address. 

11. Method for classification of signals, which signals 
each have more than two possible amplitude values, each 
sequence of signals collectively defining image background 
information and a feature, using data processing means 
having at least one processing channel (K1-K32), compris- 
ing the steps of. 

carrying out a learning process which generates and 
stores, at respective memory addresses in a memory of 
said data processing means, a table consisting of a 
plurality of probability-of -occurrence values of respec- 
tive features, each occurrence of a particular feature 
during said learning process causing incrementing of 
the stored probability value of the feature; 
sampling incoming signals to produce a matrix of scan 
values; 

selecting subsets of scan values to define samples; 
extracting at least one feature from each sample; 
using said at least one feature as an address to retrieve, 
from said table, a corresponding one value from among 
said plurality of probabflity-cf -occurrence values; 
deriving from said one probability-of-occurrence value 
for each one of a plurality of said features respectively 
corresponding to a plurality of said samples, a decision 
value; 

comparing said decision value with a predetermined 
threshold value for classifying the signals; and wherein: 
a set of prototype signals which represent an image 
feature each have a defined position in a pattern 
space; 

a hypersphere about each prototype signal's position 

defines a respective class of similar images; and 
for each class, a number of said probability-of- 
occurrence value tables is provided, which number 
corresponds to the number of processing channels 
used in said data processing means. 

12. Method according to claim 11, 
further comprising the step of 

filtering said probabiUty-of-occurrence values retrieved 
from said table. 

13. Method far classification of signals, which signals 
each have more than two possible amplitude values, each 
sequence of signals collectively defining image background 
information and a feature, using data processing means 
having at least one processing channel (K1-K32), compris- 

€0 ing the steps of: 

carrying out a learning process which generates and 
stores, at respective memory addresses in a memory of 
said data processing means, a table consisting of a 
plurality of probabm'ty-of-occurrcnce values of respec- 
tive features, each occurrence of a particular feature 
during said learning process causing incrementing of 
the stored probability value of the feature; 
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sampling incoming signals to produce a matrix of scan 
values; 

selecting subsets of scan values to define samples; 

extracting at least one feature from each sample; 

using said at least one feature as an address to retrieve, 
from said table, a corresponding one value from among 
said plurality of probability-of -occurrence values; 

deriving from said one rjrobabiUty-of-c<xanTence value 
for each one of a plurality of said features respectively 
corresponding to a plurality of said samples, a decision 
value; 

comparing said decision value with a predetermined 

threshold value for classifying the signals; 
measuring at least one of color and gray-scale brightness 
of pixels of an image; and wherein: 
each of said samples is an array of mxn pixels arranged 

in a line-and-column rectangular raster pattern; 
an offset along a line between successive samples is an 

integer number of pixel-widths between 1 and m; 
the offset perpendicular to said line between successive 

samples is an integer number of pixel-widths 

between 1 and n; and 
said features used for address generation are processed 

using a non-linear characteristic curve. 

14. Method according to claim 13, wherein m=n. 

15. Method according to claim 13, wherein 

said feature extracting step involves extracting at least one 

of the following features: 
Mean Value of the sample values; 
Activity (sum of the absolute values of the differences 

between contiguous pixels of the same sample); 
Moment of the sample values; 

Gradient (Magnitude and Direction) of the sample values 

within a sample; 
Texture of the sample values; 
Variance of the sample values; 
Fractal Dimensions of the sample values; 
Derivatives of the sample values; 
Relative or Absolute Position of a Sample; and 
Temporal or Spatial Combination with at least 1 other 

sample. 

16. Method according to claim 13, wherein said address 
includes not only said features but also supplemental infor- 
mation (Z). 

17. Method according to claim 16, wherein said supple- 
mental information (Z) includes an indication of position of 
said sample within said image. 

18. Method according to claim 13, 
further comprising 

displaying on a screen said probability-of-occurrencc 
values retrieved from said table. 
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19. Method according to claim 13, wherein said features 
used for address generation are truncated to a predetennined 
number of significant binary places. 

20. Method for classification of signals, which signals 
each have more than two possible amplitude values, each 
sequence of signals collectively defining image background 
information and a feature, using data processing means 
having at least one processing channel (K1-K32), compris- 
ing the steps of: 

carrying out a learning process which generates and 
stores, at respective memory addresses in a memory of 
said data processing means, a table consisting of a 
plurality of probability-of -ocexnrence values of respec- 
tive features, each occurrence of a particular feature 
during said learning process causing incrementing of 
the stored probability value of the feature; 

sampling incoming signals to produce a matrix of scan 
values; 

selecting subsets of scan values to define samples; 

extracting at least one feature from each sample; 

using said at least one feature as an address to retrieve, 
from said table, a corresponding one value from among 
said plurality of probabiliry-cf -occurrence values; 

deriving from said one probability-of-occurrence value 
for each one of a plurality of said features respectively 
corresponding to a plurality of said samples, a decision 
value; 

comparing said decision value with a predetermined 
threshold value for classifying the signals; 
modifying said probabfliry-of -occurrc nee values of 
respective features stored in said memory in accor- 
dance with a frequency of occurrence of each 
address of said memory to increment a value previ- 
ously stored at said address, in response to said step 
of using; and 
wherein: 

said mcorning signals have more than one possible 
amplitude value, and 

the steps of sampling, selecting, extracting and using 
are carried out in parallel, in a plurality of chan- 
nels. 

21. Method according to claim 29, 
further comprising 

deriving a frequency^f-occurrence distribution; and 
subjecting said distribution to a non-linear weighting. 

22. Method according to claim 20, 
further comprising 

using features of other samples as a factor in performing 
said incrementation. 
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